SemanticScuttle - klotz.me » Tags: simon willison+video

Tags: simon willison* + video*

0 bookmark(s) - Sort by: Date ↓ / Title /

Feed a video to a vision LLM as a sequence of JPEG frames on the CLI (also LLM 0.25)

This article details a new plugin, llm-video-frames, that allows users to feed video files into long context vision LLMs (like GPT-4.1) by converting them into a sequence of JPEG frames. It showcases how to install and use the plugin, provides examples with the Cleo video, and discusses the cost and technical details of the process. It also covers the development of the plugin using an LLM and highlights other features in LLM 0.25.

2025-05-06 Tags: ffmpeg, llm, vision, video, jpeg, simon willison by klotz

You can now run prompts against images, audio and video in your terminal using LLM

LLM 0.17 release enables multi-modal input, allowing users to send images, audio, and video files to Large Language Models like GPT-4o, Llama, and Gemini, with a Python API and cost-effective pricing.

2024-10-29 Tags: llm, simon willison, image, audio, video, gpt-4o, gemini, python, cli by klotz

Video scraping: extracting JSON data from a 35 second screen capture for less than 1/10th of a cent

The author records a screen capture of their Gmail account and uses Google Gemini to extract numeric values from the video.

2024-10-17 Tags: video, scraping, json, google gemini, llm, simon willison by klotz

First / Previous / Next / Last / Page 1 of 0

SemanticScuttle - klotz.me

Tags: simon willison* + video*

Linked Tags

Related Tags